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ABSTRACT 



PROCESSING SYSTEMS OPTIMIZATION THROUGH AUTOMATIC DESIGN AND 

REORGMIZATION OF PROGRAM MODULES 

J.F. Nunamaker, Jr., W.C. Nylin, Jr., and Benn Konsynski 



A methodoloqy is described for the automatic design of a 
processing system initially defined in terms of logical pro- 
cesses or program modules. Processes and files are grouped and 
reorganized in such a way as to produce an optimal design with 
respect to a specific target machine. Performance criteria 
for the optimal design is defined in terms of transport volume 
savings and core memory requirements. 

Starting with a graph theoretic representation of ^ the inter- 
action between processes (or modules) and files, the methodology 
consists of two components: (1) a generator of feasible alteirna- 
tives and (2) a procedure for reorganization and code generation 
for specific groupings. The generator for the feasible alterna* 
tives uses an implicit enumeration algorithm to optimize protsess 
groupings in an efficient manner. The objective is to group 
processes into modules which minimize the interaction between 
modules while still satisfying the logical requirements of the 
program and the physical constraints of the hardware. Finally* 
after the program modules have been specified, program and fil© 
reorganization will be performed to further optimize the deSi^fi. 
Reorganization includes the combination of similar data passes 
on the same file to minimize transport volume and the metqinq 
of loops to enable elimination of code and of intermediate 
data files. 

The code generator will then accept the optimal program 
design and produce an optimized source language program tot 
the target machine. Consequently, not only can an optimal 
design for the processing system be generated; but due to 
reorganization techniques, the resultant modules (defined 
from specific process groupings) may approach the computational 
efficiency expected of an integrated program. 
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I. 



INTRODUCTION 



It is recognized that perhaps the single most important problem 
which faces a computer user is that of conversion of programs to 
another machine. This is true even for programs written in "machine 
independent", high-level source languages. Changes in the system 
configuration; e.g., hardware, operating system, or file structure, 
may have altered the operating environment significantly so that the 
programs no longer take advantage of the strength of the configuration. 

For whatever reasons that make the conversion necessary, such as 
the replacement of an obsolete machine or the requirement to run the 
program on additional machines, the situation is applicable to many 
users. It is also recognized that many users have considered autO" 
matic conversion of computer programs with techniques such as emula- 
tion and simulation. However, very few users have seriously attacked 
the problem of the optimal reorganization and design of the program 
when moving it from one machine to another. 

Many users are of the opinion that anything less than 100% 
automatic conversion is not worth considering; however, it can be 
Stated emphatically that less-than-complete conversion tools are 
useful and the redesign and reorganisation are neeessary for effi« 
cient operation of the resulting program modules. 

The transferability problem toucheg on all aspeets of softwaie 
design; specific methodology from decompiling^ graph models of pro- 
grams, operations research search techniques, and problem statement 
languages are used to form an approach to the problem. 
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II. METHODOLOGY 

What is needed is a methodology for converting, redesigning, and 
reorganizing programs from one machine to another as a result of 
stated performance criteria. 

This paper discusses a software system for the design and re- 
organization of computer programs, and a methodology is described 
^for"_the automatic design of a processing system initially defined 
in, terms of logical processes or program modules. Processes and 
files are grouped and reorganized in such a way as to produce an 
optimal design with respect to a specific target machine. Perform- 
ance criteria for the optimal design are defined in terms of transport 
Volume savings, core -memory requirements, and input/output requirements . 

Transport volume of a system is a measure of performance that 
is related to total processing time. Processing time is a non- 
decreasing function of transport volume; therefore, it is desirable 
to decrease the transport volume of a set of program modules. It 
was shown by Nunamaker [1,2] that there exists a class of process 
groupings which result in a reduction of transport volume when two 
or more processes are grouped. Using a simple case as an example, 
the transport volume is reduced when two processes are grouped if 
th% output of one is the input to the other process. As a result 
of the grouping of the processes into a composite program module, 
the core requirement will be increased and the input/output require- 
ments of the system may "be affected. 

In this paper the assujnption is made that we are starting with 
a well-defined problem, and that the set of processes can be deseribed 
in terms of a directed graph. In addition, we know other information 
such as frequency, volumes, etc. 

Building on the graph theory representation of the interaction 
between processes (or modules) and files, the methodology consists Of 
two components s (1) a generator of feasible alternatives and (2) a 
procedure for reorganisation and code generation for specific gfOUpiftfS. 
fhe generator for the feasible alternatives uses an implicit eniimefa* 
tioft algorithm to generate alternative groupings in an efficient man- 
mis* The objective is to group processeg into modules which minimize 
Yr- the interaction between module p^while still satisfying the logical 

^ 6 



NUt^AMAKER, Program Module Design • 3 

requirements of the program and the physical constraints of the hard- 
ware. Finally, after the program modules have been specified, program 
and file reorganization will be performed to further optimize the 
design. Reorganization includes the combination of similar data passes 
on the same file to minimize transport volume and the merging of loops 
to enable elimination of code and intermediate data files. 

The Program Module Generator presents a very large set of feasible 
program modules for the target machines; it is the task of the selec- 
tion algorithm and the reorganizer to construct a reasonably good set 
of program modules. The code generator then accepts the optimal 
program design and creates the optimized physical code for the target 
machine. Thus, the Program Module Generator chooses from among all 
conceivable combinations of processes for program modules and .'•selects 
the "best" design after considerable interaction with the program 
reorganizer. 

Consequently, not only can an optimal design for the processing 
system be generated, but due to reorganization techniques, the re- 
sultant modules (defined from specific process groupings) may approach 
the computational efficiency expected of an integrated program. 

An overview of the methodology for the automatic design and re- 
organization of program modules is shown in Figure 1. The specific 
subject of this paper begins with the assumption that a Problem 
Definition exists and is shown below the dotted line in Figure 1. 
The problem definition, generation, and translation into a problem 
statement is the .subject of another paper [2] . 

III. DEFINITIONS 

A programming system, P£ = (PR,F,T,E) , is defined as a set of 
processes (PR), files (P) , the control flow of the files (T), and the 
relationships of the set of processes and files (E) . These defini- 
tions are extensions of the work of Langefors [3] and Briggs [4]. 

pr - Process — A well-defined task representing a pass over ©fte 

or more data files; where PR* (pr-^, .pr^^, P^f^'i * 

f - File--The data input or output of a process; where 
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T - Control Plow Matrix--The control flow precedence relationship 
of the prog r aiming system. 

t. . = 1 if control flow can pass directly from f. to f . . 

t . . = 0 otherwise. 
ID 

E - Incidence Matrix — Processes and files, 

e . . = 1 if f . is an input to pr . . 
13 3 . ^ 

e^j = -1 if f j is an output of pr^. 

e. . = 0 if there is no incidence between f . and pr . . 
ID D 1 

The control flow of a network is described by T, and the data 
relationship of the processes and files in a network is described 
by E. 

From the Incidence Matrix we can define the concept of transport 
volume. Transport volume is one component of the performance criteria 
which are used to evaluate alternative program module design. Per- 
formance criteria for program module design is a function of the fol.- 
lowing components: (1) processing time, (2) transport volume, (3) core 
size, and (4) the number and type of input/output units required. 

Let V, be the volume of file f^;. l^, the number of logical inputs 
and outputs of pr^; and mp^, the multiplicity of file transport for f ^ . 
Thus mp . represents the number of times f . is an input or output of a 
set of processes; cm^ represents the core memory required by pr^. 



^ j«l 



inp^ = £ ' D = 1/2,.. .,k. 

3 i-1 

The transport volume for f^ is.; 

tv . - mp . • v . . 

D ^D D 



The transport volume for the set of data files is: 
TV = tv.. 

Let pr^ be represented by a [ | and f ^ by a Q 
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The Incidence Matrix (E) and the associated incidence graph for a 
prograinraing system of six processes and ten files is shown in Figure 2 
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Figure 2. Incidence Graph and Matrix- E for a prograitttning systen 
of 6 processes and 10 files. 
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This example is also used later in Section VII* 

We now must define additional matrices needed for the grouping 

procedure. 

10 
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The E matrix of processes and files is used to generate the data 
flow Precedence Matrix of processes P. Note that a distinction is 
made between the control flow T of the programming system and the 
precedence relationship of. the processes with respect to data flow. 

p - Precedence Matrix; Processes 

p^j = 1 if pr^ is a direct precedent of pr^. 

p. . = 0 if otherwise. 

P can be reconstructed from E as follows: 

p^j = 1 if and only if 3 A 3 e^^^ = -1 and e^^^ = 1. 

The Precedence Matrix (P) of processes for the example of Figure 2 
is shown in Figure 3. 
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Figure 3. Precedence Matrix of Processes P. 
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The R*, and G matrices are .generated for the entire set of 
Processes . 

R - Reachability Matrix; Processes 

The R matrix is used to check precedence violations in th@ 
grouping procedure 

R « P V P^V. . . .V P^*-^ 
where q is the index of the nilpotent matrix P? i.e.^ when 

r^j = 1 if pr^^ has any precedence relationship with pSy 



tAA^O otherwise. 

3. J 
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The Reachability Matrix (R) of Processes for the example of 
Figure 2 is shovm in Figure 4 . 
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Figure 4. Reachability Matrix of Processes R. 

R* - Partial Reachability Matrix; Processes 
The R* matrix is used to calculate the G Matrix. 
R* = p2 V P^ V. . . .V P^"-'- 

r*. . = 1 if pr. has a higher (2 or more) order precedence 

with pr . . 
D 

^*ij * ^ otherwise. 

The Partial Reachability Matrix (R*) of Processes for the exampl< 
of Figure 2 is shown in Figure 5. 
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Figure S. Partial Reaehability Matrix of Processes S*. 
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q - Feasible Process Pairs Grouping Matrix: Processes 
If g^j ss -1, there exists higher (2 or more) order relationships 
between pr^ and pr^; and pr^ cannot be combined with pr^. If g^^ « 
there is no precedence ordering; and pr^ can be combined with P^y 
This indicates a feasible but not necessarily profitable grouping. If 
^ij ~ ^' either a direct precedence relationship, and pr^^ can 

and should be combined with pr . since this indicates a feasible and 
profitable grouping; or there is an immediate reduction in logical 
input/output requirements when pr^^.and pr^ are grouped. 

g. . = -1 if r*. . or r* . . = 1 or i«j . 

g. . ss 0 if r*. . =0 and r*.. ~ 0 and p . . « 0 and p.. « 0; except 
i]3 ij jJ- ij ji 

when (Pj_j^=l and Pj^=l) or (Pj^j_=l and Pj^j*l) • 
g, , = 1 if r* . . = 0 and r*.. « 0 and (p^ 4*1) or (p..*!) or 

^^iC^ and Pj^^D or (Pj^j~l and Pj^i=l)j • 

pr^^ has a first order precedence or succedence rela- 
• tionship with pr^ and pr^. 

The Feasible Process Pairs Grouping Matrix (G) for the example 
of Figure 2 is shown in Figure 6. 
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Figure 6. feasible Process Pairs Grouping M^tfix G. 
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A list of all feasible pairs for grouping of processes is e'©n« 
stfueted from the G Matrix and passed to tiie generator of alternative 
' ffdupings. 

13 
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It is known that by grouping processes into a compositG process 

called a program module, the multiple input and output of files can be 

reduced. A program module PM^ is created by combining and reorganizing 

processes pr. , pr^ pr^.. Let Lq be the source language; L^, 

12k 

the inte. mediate language; and the target language. Notrn that a 
situcition m-y exist in which a single language could serve ar. Lq/ L^^, 
and 

Define procedure RG which maps processes in language Lq into re- 
organized modules in language L^^. RG performs thR following tasks: 

1. Conversion of the individual processes from Lq to L^^ if Lq ^ L^^. 

2. Transformation of the processes written in L^^ into a syntactically 
and logically correct module in L2. 

3. Reorganization of the module. 

Thus, one can d'efine a program module PM^ using RG. 

PM. a RG(pr . , . . . . , pr . ) is a feasible program module if 
^ ^1 ^k 

or e PR,l<i<k, and PM. satisfies all the constraints for a valid 

subprogram in language L for the target machine. Thus, a feasible 
program module must satisfy the core memory and logical constraints o€ 
the program. 

M. = {pr . , pr. , . . . . , pr^ } is a feasible grouping if PM, is a 
^ ^1 ^2 ^k 

feasible program module. 

6 = {M^, Mg}is a cover fdr the programming system fB it 

i) M^, lliiq is a feasible grouping. 

ii) Mj_U U. . .U « PH. 

iii) M^^Mj * 0 for i ?^ j V j<q, j<i, i<q. 

Note that PM^ « RG(prj^) must be a feasible moduls fosf l<iin io^ 
a eever for PS to exist. We can now define the set ©f all possibl© 
eevers (A) for PS. (i.e.^ A « {6|6 is a cover of PS).) 

14 
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IV. PROCESS GROUPING CONCEPT 

. In generating an efficient design, it is necessary to decrease 
the transport volume (total number of characters read in and written 
out of main memory) in order to reduce the processing time. If file 
volumes remain constant, in order to decrease the transport volume, 
the multiplicity (the number of times a file is input and output) of 
file transport must be decreased. After the Program Modules are 
specified, the files are consolidated for the purpo of reducing the 
number of input/output files required and for better utilization of 
storage in auxiliary memory. Process grouping is shown to correspond 
to a grouping of rows of the Incidence Matrix, and file consolidation 
is shown to correspond to a grouping of coluiuns. 

Program module design is concerned with the reduction of pro- 
cessing time and can be summarized by the two methods by which the 
processing time can be reduced. The generator of alternatives deter-, 
mines which operations (Processes) will be grouped into Program Modules 

1. Group Processes which eliminate the writing out and the reading 
in of a file. Consider the example in which the output of oniS 
process is the input to another process, as shown in Figure 7. 




Pigu 



re 7. The output if pr* is the input to pr^, 



The transpor 
as shown in F: :ure 8". 



t volume of £ is eliminated when pr^ and pr^ are grouped 



(Ty^ A,B — »^(b) 
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Figure 8. Grouping of pf^ and pr^ ot Piguf© 7 
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2. Group Processes which require the same file as shown in Figure 9. 




Figure 9. pr^ and pr^j read a common input file f^. 

The transport volume is reduced when pr^ and pr^ are grouped since 
is read only once as shown in Figure 10. 



0-* 



Figure 10. Grouping of pr^ and pr^ of Figure 9. 

It may be profitable also to group f^ and as shown in Figure ll# 
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Figure 11. Grouping of f^ and fj^ of Figure 10. 

The objective is to reduce total transport volume and thus total 
processing time. 

The concept of process grouping is illustrated with the example 
from section VII of the paper and the inoidence Matrix and graph for 
the example is shown in Figure 2. The transport volume TV for the 
example is 350 units, the core memory required is 30 units, and the 
maximum input/output requirement for any proeess is 3. The crueial 
items in determining which processes to group into modules are ths 
transport volume and main memory size* It is desirable to produee 
modules (subjeet to the memory constraint) minimizinf the tfansgoft 
volume for the programming system. ^ 
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Consider as an alternative design combining pr^^f P^b' ^*^C 
into one program module and pr^, P^e' ^^P ^ second program 

module. The resulting transport volume TV is 270, the core memory 
will be no larger than 55 units, and the maximum number of input/ 
output processes for each program module has increased to 5. This 
alternative design is shown in Figure 12. 
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Figure 12. Grouping Processes of Figure 2 into two Program Modules. 

The transport volume for the example of Figure 2 has been redueed 
from 350 units to 270 units, the core requirement has increased fifom 
30 units to 55 units for program module size and the maximum input/ 
output requirement has increased from 3 to 5* 

Several assumptions are made in the graphical repxfesentation @f 
the processes and files. Contifol flow over a file is assumed t& eniat 
for any process. Multiple data passes over an input and output file 
as illustrated in Figure 13 are shown as follows In Figute 14 iof the 
eomputation of transpo^^t volume. 
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Figure 13. Multiple data passes over an input file and output file. 



Figure 14. Graphical representation of the examples of Figure 13 for 
the purpose of computing transport volume. 

In other words, an element of file "a" is read and au element of 
file "b" is written. Control then reads the next element cf "a" and. 
writes the next element of "b". Therefote, cycles within a process 
are not shown in the Incidence Graph, but are assumed to exist in the 
Incidence Graph and are shown in the control flow graph. 

In addition, the situation may exist in which control flow actually 
passes completely from file "a" to file "b". This is the case when the 
entire file "a" is read before file "b" is written. For example, the 
process may involve a sort on file "a", or a complete read of file "a" 
may be required to compute various sums that are dependent on the con* 
tent of file "a". 

Both cases are represented as having the same incidence matrix 
•for purposes of computing the transport volume. The T or contifol flow 
matrix reflects occurrences of multiple data passes over a file* 

The P matrix and E matrix are the same for both cases and the T 
matrix is different. This is illustratad as follows in Figure IS* 

It can be noted that although the transport volume for specifie 
files has been eliminated by the grouping, storage (in main memory) 
fcr the information contained in those files is still necessary* It 
is the purpose of reorganization to automatically restructure the 
GOfflbined processes towards the reduction of the number of loops over 
that information and thus possibly eliminate the need to maintain it 
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between two data passes. This can be accomplished by restructuring 
the loops (consecutive data passes) in an attempt to provide only one 
loop over the original data file. Consequently, only a specific ifecord 
of that information file may need to exist for each pass through tM 
merged loop* That is, the information in that record could be computed 
and used by the same pass, and no longer be necessary upon completion 
of that pass. Such is the case in the example presented in Figure 2 
and described in a later section in which the combination and reotgani- 
nation of processes E and F allow for the elimination of the Customer 

Transaction File £". 

A record of the Warehouse Transaction File g was used to eompute 
each record in the Customer Transaction Pile Similarly, a reeord 
of £" was used to compute each record in. the Customer Order Pile @* 
and the Customer Payment Pile e". By grouping and reorganiging two 
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processes, E and F, the individual records for e' and e" can be com- 
puted from a record of f"" (which was just computed from a record of y) . 
Consequently, only the storage for a record of f" is necessary in 
comparison to storage for the entire file. In addition, it may be 
possible to eliir.inate the record of f" and compute those for e' and e" 
directly from g. Such a procedure requires the following subtasks : 

1. Combining processes into logically and syntactically correct 
modules., 

2. Comprehensive control and data flow analysis. 

3. Restructuring the module to" merge data passes (or loops.). 

4. Elimination of unnecessary files (or data). 

V. PROCESS GROUPING DETERMINATION 

The problem of assigning the various processes to modules is a 
complex combinatorial problem complicated by the fact that savings 
from reorganization cannot be predetermined. However, prediction of 
savings in transport volume which result from process groupings can 
be accomplished. 

The G Matrix relates the profitable binary groupings; i.e., those 
in which a savings in transport volume is incurred. 

An interaction matrix is created to reflect the transport volume 
savings encountered in a binary grouping of processes.. This savings 
matrix or S matrix is computed as follows: 



ii - Z ^] 



S. - = S 
ID J 

k=l 



1 if e.^ « 1 and e.,^ = 1. 



2 if ie^y. = -1 and e^j^ = l)or(e^j^ « 1 and e^j^ « -i 
0 otherwise. 

Where k is the file subscript' of the Incidence Matrix. 

If the binary grouping is immediately infeasible, we maintain the 
potential savings we would encounter if the grouping is made at a later 
time . 

The TV savings which result from a grouping of n processes is 
then reflected in the sum of the savings of binary grov-^ings of the n 
processes. (NOTE? There are n(n->iy binary oombinations of prooesses.) 

Thus, our S matrix is presented in Figure 16 for the example problem 

o given in Figure 2. 
ERIC 
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0 




C 


0 


0 
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Figure 16. Matrix of transport savings for pairwise groupings 
for the example of Figure 2. 
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TVS is the Transport Volume Savings function for any grouping M. 

TVS (M) 



■> 



S. . 



pr.eM, pr.cM 

i J 



Thus, from above: 
TVS (DEP) = S 



^„ + S^„ + S^„ 
DE EP DP 



i<j 

« 0 + 40 + 20 = 60 



TVS (ABD) = S^g + Sgj^ + S^jj « 20 + 0 + 100 = 120 

Note that this short-cut method for computing Transport Volume 
Savings (TVS) is based on the assumption that a file is input to no 
more than two processes. If a file is input to three or more processes, 
the Transport Volume Savings m.ust be computed directly from the Inci- 
dence Graph of the proposed design. 

Transport Volume Savings is supplemented by gains occurring as a 
result of reorganization. 

The objective is to minimize transport volume subject to e&£^ 
constraints and feasible grouping consideration. This may be stated 
as follows: 

If C(M.) is the core requirement for module RG(M. ) and CT is the 
i * 

eore constraint/ 

C(Mj^) < CT , 1 < i < q 

It is important to note the interaction effddts eneountered afs 
of a non-convex nature; i.e., there exiit looal optima which af© not 
necessarily global optima; thus^ to guarantee a global optimum , the 
entire solution space must be considered. 
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The grouping procedure was first formulated as a Quadratic 
Assignment Algorithm [5] in order to implicitly truncate unprofitable 
solutions from the solution space and facilitate speed of convergence 
to a good solution. However, optimality cannot be guaranteed for 
this problem as we formulated it using the Quadratic Assignment 
Algorithm. The process grouping procedure described in the next 
section is formulated as a straightforward enumeration scheme. The 
number of feasible designs is not too large for problems of 50 
processes or less whe.. je memory and precedence relationships are 
used as constraints on Module size. 

A. GENERATION OF FEASIBLE PROCESS GROUPINGS TO FORM MODULES 

The G matrix is used to create a listing of binary groupings. 
The binary pairs (i,j.) of the upper triangular portion of the matrix 
are selected if g . . = 1 and it is not true "that 3 fi- 3 Pj . and r,. « !• 

The consequence of allowing the (i,j) pair into the pair list given the 
above condition is the generation of infeasible groupings. Consider 
the following precedence graph of processes as shown in Figure 17. 



\7 



Figure 17. Precedence graph of processes* 

We observe that p^g = 1 and r^^ « 1 hold? thus, AC is eliminated from 
the pair list; otherwise {AC,B} appears to be a feasible design, whieh 
it is not. By eliminating the pair {AC) from the feasible list of 
pairs, we create only feasible designs.- (NOTE: {ABC} is still a 
feasible grouping.) 

Once a list of feasible pairs is created, feasible modules of 
size 3, n, are generated by the following algorithm. 

A list V of feasible #groups of si2e k+1 is generated using the 
list Y of groups of size k by selecting an element of Y and comparing it 
with ©very element of the list of pairs and adding a process t© the 
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grouping if that process is in a pair with.,an element of the group and • 
no output of any process of the element of Y is input, or reachable as 
input, to the process to be added to the grouping. This means that if 
Y e Y and pr^ ^ y then y U pr^ is a feasible grouping if 3- pr ^ c y 3 

g^^ « 1 and for all prj^ e y if 3- pr^^ i y and ijH then Pj^^^ = 1 implies 
r^^^ 1 and p^j^ = 1 implies r^^ / 1. 

This is to say that a process is added to the grouping if there 
is no output from a process of the group which is indirect input to 
the candidate process; i.e., there is no process not in the grouping 
which accepts output from the group and generates input to the candi- 
date process. An example is shown in Figure 18 of the problems that 



A 



B 



D 



Figure 18. Illustration of Potential Grouping Problem, 
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can arise in the grouping procedure. If (A B C} is a grouping, then 
we see that the grouping {A B C D} violates our rules for acceptance 
of the candidate D for Pg^^l and r^^j^^l and yet E {A B C} and E^D. 
Thus {A B C D} is not a feasible grouping. We can see, however, that 
{A B C E} and {A B C D E} are feasible groupings. 

From the list of feasible pairs, all feasible triples are gene- 
rated. T^o pairs are grouped if they have one process in coflunoA. ftom 

valid pairs and triples, designs of size 4. , n, are generated. 

This procedure generates modules from size 2 up to n where n is 
the number of elements of PR. The result is a list of all possible 
feasible process groupings M, with the exception of the individual 
processes. Thus, in the example we call this listing S, and 
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The set Y^"**^ of feasible modules 
is initially ompty. 

^+1 

Create process groupings r 
of size m+1 from groupings \ 
of size m and the list of 
feasible process pairts. 




^ pairs 
considered^ for y? 



pr.; pr 



select a 


pair 


o£ 


processes 






from the 


list 


of 


feasible process 


pairs 



If only one process 
of the proceijs pair 
belongs to the m 
size grouping y, 
form a m+1 size 
grouping y • . Add 
pr i y. 



Process grouping y 
is a possible program 

module • 



If y' is already a member of 

v'"'*'^ then create another 
program modulo • 



Check core constraint for 
progreun module y*« 



l8 there a precedence 
violation v/ith respect to the 
m size process grouping y And 
the candidate process pr* ? 
if a precedence violation 
exists # then reject the 
grouping y\ where y'^^y U prj^ 

Add the process grouping y* 
to the set of feasible 



program modules V^"**^* 




no 



yes 



Figure 19. Generation of Feasible Process Groupings To Form Modules. 
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The set of feasible syEstcms designs 
a"*"*"^ is initially empty. 

Create groupings o"'^'^ 
of size m+1 from groupings 



m+1 



e*" of size m. 



I 



Select: o"' c a"' 



yes 




all b G B >^ Select processing 
congidered>^ grouping (module) 

for e";? ^ I- 



b from B 



If candidate module b has 
a process in common with 9' 
reject the candidate grouping 

e"^ u b. 



Module grouping 0 is a 
possible programming systems 
design* 



no 




o"^^^ - o"^ U b 



If 9 
m+1 



m+1 



is already a memb.or of 



a' ~ then create another grouping 
of modules to form a new systems 
design. 



Is there a precedence violation 
with respect to the m size module 

grouping 6^^ and the candidate 
module b? If a precedence 
violation exists , then reject the 

q"" U b. 



grouping e"^"^^, where e"^^^ 



Add the module grouping Q^^^ 
to the set of feasible systems 

designs a"^^^. 




^m+1 ^ .m+1 u gm+1 



Figure 20$ Generation oj! Alternative System Designs 
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B « 



{AB) 
{AD} 

{EF} 
{ABC} 

{def} 
{abcd} 



{ABC def} 

the list of all feasible groupings M is the union of the set B and the 
set PR or M = B U PR. See the flow chart of Figure 19. 

B. GENERATION OF ALTERNATIVE SYSTEM DESIGNS 



A similar procedure is used to select all feasible combinations 
of modules (process groupings) that form a cover for the programming 
system PS. Note that all elements of B satisfy the system's core 
constraint so modules may be combined freely so long as no modules 
in the cover have any common processes. The core constraint of a 
system is satisfied as long as the core requirement of the largest 
program module in the design does not exceed the 'constraint. 



m 



The proce.lure seeks to generate sets (a''"}, m = 1, s^ where 

a is a set that contains all possible combinations of m modules and 
s £ n is the maximum number of modules that can be combined. Clearly, 
a 



' = B. 



Proceeding by inductive definition, a^"^^ is generated from a^. 



To generate a"^"^^, for each element e"^ e a"^, 9^^ is combined with any 
element b g B for which b has no process in common with the modules 
.m .m-rx ^ ^ . , .m-x ^j^^^ proeeduifa 



Then, d^"^^ s e"^ U b becomes an element of a"^*^^ 



of 9' 

is shown in Figure 20. 

Further, as above, there must be only direct precedence between 
the set e"^ and the candidate module b; i.e., the case, as illustrated 
in Figure 21 does not occur. 



gm 




Module 
b 
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Figure 21. Illegal Grouping Situation* 
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Thus, for example, if 

{BC,AD} e and {EP) e 3 and {BC,AD} ^ {EF} = <\> 

Then {BC,AD,EF} e a^+^ = a^. 
Further, if 

{ABC) e and {EP) e 6 and {ABC} {EP} = (|) 
Vhen {ABC,EF} e a^"^^ = 
Note in the first case a cover of PS is formed. 

After all combinations have been enumerated, a cover 6 is formed 

from each combination 9 e a as follows: We let 8 e a = {M^^, 

M, } , where M. e B and feasible module M. = {pr. , pr. }. M. 

k ' 1 1 1 

contains n^ processes. 

Then the residual processes can be defined as: 



n . 



b' = |{pr}|pr e [PR- (M^)]l; thus 6 = 8^ U b'. 

I ^"^1 mm 

In other words, a feasible design is any element 0 of any. ot 

combined with all ungrouped processes as individual modules. For 

example, 6 = {ABCD, E, F} is a feasible design for the programming 

system diagrammed in Figure 2. 



C. TitANSPORT VOLUME SAVINGS CALCULATION 



The transport volume savings - for any design can be calculated by 
examining each -module of the design as stated earlier and the savings 
for the design is the sum of the savings for any modules. Thus, if 
6 - {M,, M,, M }, then the transport volume savings for 6 is 

TVS (6) => 2 S .j^ 

. j<k 

The optimal design with respect to transport volume savings can bi 
designated as follows: 

• = MAX ( TVS (6) ) 

6 
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The core requirements for a design can be designated as the maximum 
requirements for any module of the design; thus, it is 



Ma 



ximum [ C{H^) , C{n^) , , ,C(Mg)] . 



The list of designs are then sorted in ascending order by core and 
descending order by transport volume savings and the x best designs 
(highest transprrt volume savings) are saved for each range of core? 
i.e., top ten for range 20K-'30K bytes of memory, top ten for range 
31K~4 0K bytes of memory, etc. The core constraint may be as a result 
of a partition size or an arbitrary constraint on module size. The 
number of designs (x) to save for each range of core memory is arbitrary. 

VI. COMBINING PROCESSES 



The combining of processes in Lq into program modules in re- 
quires the translation of those processes into Lj^, This translation 
can be made either before or after the processes are cdmbined. How- 
ever, if the translation is made first, then the procedures for com- 
bining program modules are over tho same language as the reorganiza- 
tion procedures. This would enable new processes to be addod to 
already reorganized module'.?. Procedures necessary for combining modules 
include those to resolve identifier conflicts, interface the modulCJ3 
with respect to external files, and perform structural modif icatidns 
necessary to produce the desired syntactically and logically correct 
module . 

Nylin [6] has discussed techniques which have been used for the 
implementation of these procedures. In addition, other techniques 
can be used to take advantage of any commonality which may exist 
betw(sen the modules being combined. Yershov [7,8] described techniques 
to efficiently utilize storage by allowing certain variables to use the 
same memory location. Similarly, algorithms exist for detecting eommdft 
data storage areas and to eliminate redundant procedure definitions (61. 

Much of the control flow information necessary for reorganiaatieft 
can be accomplished with existing techniques such as analysis using 
Boolean connectivity matrices first described by Prosser m, ot the 
interval method described by Cocke and Schwartz [10] and Allen (111. 
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Techniques for additional control flow information which can be used in 
reorganization are described by Nylin [6] . Existing techniques [12, 13, 
14] can be used to compute variable "usage" information utilizing the 
' data gathered in the control flow analysis. 

The control flow and data flow infcrination procedures are neces- 
sary to locate loops (data passes) which may be candidates to bo merged. 
Of particular interest are loop pairs in which one loop is always 
executed the same number of times as a specific branch of the other. 
Thus, one loop computes the control parameters for the other. If this 
can bo determined and ii: additional data flow information allows the 
loops to be merged, one of the loops can be eliminated and its body 
moved into the other loop. This procedure may include the replacement 
of an induction variable as well as the redefinition of certain program 
variables necessary as a result of merging the loops. It should be 
noted that the complexity of the control flgw within either loop is not 
a factor in the ability to merge them. 

Once the loop& are merged, it may be possible to apply subsump- 
tion techniques [6,13] to eliminate unnecessary stores into variables. 
That is, the definition of a particular variable (which could be the 
internal representation of a data file) may be able to replace subse- 
quent occurrences of that variable. Thus, due to the merging of two 
loops (data passes) , it is no longer necessary to maintain information 
. which is only utilized by a specific pass through the merged loop. 
Hence, an intermediate file (used for communication between the loops) 
can be eliminated. ; 

Once a decision !ls made as to which processes are to orr.i a module, 
they must be integrated by some automatic procedure. This combining 
of specific processes to form a program module that can be executed on 
the target machine involves several steps. First, the processes must 
be able to be represented in an intermediate language that has the 
following attributes: 

1. An ability to measure the memory required to implement the 

processes on the target machine. 

2. The ability to automatically analyze the grouped processes and 
perform reorganisation procedures on their loop and file st^ueture* 

3. The ability to map thR grouped processes into the desired programs 
^ on the target machine preserving their reorganized structure. 
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It should be noted l.hat the intermediate language (Problom Statement 
Language) could be either the original language or the desired language 
for the target machine. It may even be desirable to compile the final 
grouping in the intermediate language to produce object code directly. 
This could be especially advantageoufs if the processes are described 
by a high-level problem statement language. That is, there exists no 
need for any other level of documentation since new modifications to 
the system would be made at the problem statement level. Thus, when 
changes are made to the existing processes or new processes are added, 
the final modular progran-unmg system can be automatically regenerated. 

Clearly, this is necessary to guarantee that the system remains 
optimal and that no errors are introduced by adding code to a module 
consisting of multiple processes. 

Once the processes are represented in the intermediate language 
L^, they must be combined into a logically and syntactically correct 
program module for L^. The program modules (processes represented in 
L^) can be automatically combined to form larger modules in L^. These 
multipass modules could be automatically analyzed; and, if possible, 
reorganized to combine multiple passes over the same file. In addi- 
tion, in some cases the file could also be eliminated. The elimination 
of such files not only increases the efficiency of the resultant module 
but it decreases the laemory it requires [6] . 

In addition to the directed graph representation of the set of 
programs, the following information is assumed to be available for 
module reorganization. 

?. Process documentation 4. Pile usage 

2. Source deck or list of processes 5. Input and output test data 

3. Operating instructions 6. Frequency of process cyclcfJ 

VII . EXAMPLE 

The example below is a system of processes which creates a 
warehouse shipping schedule. 

The input is considered to be a transaction file containing 
Receiving Reports, Customer Orders, and Customer Payments. The 
transactions are divided into a receiving file and a customer 
* transaction file. 

The receiving reports are used to update the inventeify on hand 
file, while the customer orders and payments are separated and a 
payment sumir\ary produced. 
Q The Incidenee Matrix for this example is shown in figure 2« 
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PROCESSES 

A - Shipping Schedule Generator 

B " Order file sorting for Scheduler 

C - Customer Payment Summary Generator 

D - Inventory Update 

E - Separate Customer Payments from Customer Orders 

F - Separate Receiving Report from Customer Transactions 

FILES 

a - Shipping Schedule Report 

b - Customer orders sorted by item 

c - Customer Payment Summary 

d - Updated Inventory 

e' - Customer Orders 

e" - Customer Payments 

f - Receiving Report 

f" - Customer Transaction 

g - Warehouse Transaction 

h - Old Inventory Master 



The files e', e" , and f are described below: 



RECEIVING REPORT (f ' ) 

Columns Data 

1 - 2 'RV 

3 ~ 7 Vendor Number 

8-27 Vendor Name 

28 - 47 Vendor Address 

48 - 55 Value of Goods 

56 - 60 Component Number 

61 - 65 Quantity Received 

66 - 71 Date Received 

72 - 77 Blank 

78 - 79 Warehouse 



CUSTOMER ORDER (e' ) 

Columns Data 

1-2 'C0' 

3-7 Customer Number 

8-27 Customer Name 

28 - 47 Customer Address 

48 - 55 Value of Goods 

56 - 60 Component Number 

61-65 Quantity Received 

66 - 71 delivery Date 

72-77 Order Number 



CUSTOMER PAYMENT (e") 



Columns 


Data 


1-2 


'CP' 


3-7 


Customer Number 


8 - 27 


Customer Name 


28 - 47 


Customer Address 


48 - SS 


Amount Paid 


56 - 71 


Blank 


72 - 77 


Order Number 



P, R*^ and G Matrices for the above example are given in Pifures 
^ 3/4^5/ and 6 respectively. 
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The transport volume savings Matrix S is given in Figure 16; thus, the 
total transport volume for this e^xample is 350 units. The procedure 
detailed in Figure 18 was executed for CT«50 with the resultant module 
groupings : 

= {A,D} 
M2 = {B,C,E,F} 
C{M^) = 50 < 50 
ClM^) = 45 <^ 50 

With the organizing completed, the final transport volume was 170. 
This resulted in a savings of 180 units and only 40 units more than 
the absolute minimum of 130 units. If the core constraint is relaxed, 
the minimum transpoxt volume is obtained when all processes are 
grouped into a single module. 

To illustrate combining processes into modules utilizing reorgani- 
zation techniques, consider processes D, E, and F. Modules representing 
each of these processes can be represented by the following COBOL pro- 
cedure divisions. 

PROCESS D 

OPEN INPUT OLD-INVENTORY-FILE, RECEIVING-REPORT-FILE, 
OUTPUT UPDATE-INVENTORY-FILE . 

REWIND RECEIVING-REPORT-FILE . 
LABEL. READ RECEIVING-REPORT-FILE AT END GO TO CLOSER. 

PERFORM UPDATE INVENTORY-FILE. ' 

GO TO LABEL. 
CLOSER. CLOSE ALL FILES. 



PROCESS E 

OPEN INPUT CUSTOMER-TRANSACTION-FILE, OUTPUT CUSTOMER- PAYMENT- 
FILE, CUSTOMER-ORDER-FILE. 

REWIND CUSTOMER-TRANSACTION-FILE AT END GO TO CLOSER. 

IF CODE OF CUSTOMER-TRANSACTION EQUAL 'P' THEN WRITE CUSTOMER' 
PAYMENT-REC FROM CUSTOMER-TRANSACTION ELSE WRITE CUSTOMER- 
ORDER- REC FROM CUSTOMER-TRANSACTION. 
GO TO LABEL. 

CLOSER. CLOSE ALL FILES, 
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PROCESS F 

OPEN INPUT WAREIIOUSE-TRANSACTION-FILE, OUTPUT RECEIVING-REPORT- 
FILE , CUSTOMER-TI^NSACTION-FILE 
REWIND WAREHOUSE-Tr-IANSACTION-FILE 

LADEL, READ WAREIIOUSE-TRANSACTION-FILT^ AT END GO TO CLOSER. 

IF CODE OF WAREHOUSE-TRANSACTION EQUAL 'R' THEN WRITE RECEIVING- 
REPORT-REC FROM WAREHOUSE-TRANSACTION ELSE WRITE CUSTOMER- 
TRANS ACT ION- REC FROM WAREHOUSE-TRANSACTION < GO TO LABEL. 

CL03ER. CLOSE ALL FILES. 

By combining and reorganizing Processes E and F into one module, 
the following integrated module is generated. 

MODULE E-F 

OPEN INPUT WAREHOUSE-TRANSACTION-FILE, OUTPUT RECEIVING-REPORT- 

.FILE, CUSTOMER-ORDER-FILE, CUSTOMER-PAYMENT-FILE . 
REWIND WAREHOUSK-TRANSACTION-FILE. 
LABEL. READ WAREHOUSE-TRANSACTION-FILE AT END GO TO CLOSER. 

IF CODE OF WAREHOUSE-TRANSACTION EQUAT. 'R' THEN WRITE RECEIVING" 
REPORT-REC FROM WAREHOUSE-T.RANSACTION ELSE IF CODE OF WAREHOUSE- 
TRANSACTION EQUAL 'P' THEN WRITE CUSTOMER-PAYMENT-REC FROM 
WAREHOUSE-TRANSACTION . 

WRITE CUSTOMER-ORDER- REC FROM WAREHOUSE-TRANSACTION. 
GO TO LABEL. 
CLOSER. CLOSE ALL FILES. 



Thus, the processes are able to be combined with the elimination 
of the Customer Transaction File (file f"). 

Similarly, Processes D and F can be combined and reorganized to 
eliminate the Receiving Report File (file f). The resultant module 
is as follows: 

MODUBE d F 



Op!eN input WAREHOUSE-TRANSACTION-FILE, OLD- INVENTORY-FILE, 
OUTPUT CUSTOMER-TimNSACTION-FILE, OLD- INVENTORY-FILE. 

REWIND WAREHOUSE-TRANSACTION-FILE . 
LABEL. READ WAREHOUSE-TRANSACTION-FILE AT END GO TO CLOSER. 

IF CODE OF WAREHOUSE-TRANSACTION EQUAL 'R' THEN UPDATE 

INVENTORY-FILE ELSE WRITE CUSTOMER-TRANSACTION-REC PROM 

WAREHOUSE-TRANSACTION. GO TO LABiL. 
CLOSER. CLOSE ALL PILES. 

The ability to combine Processes D and P (eliminating file f) 
and E and P (eliminating file f") does not guarantee that both files 
can be eliminated by grouping Processes D, E, and P. That is, eertain 
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program variable dependencies existing between Processes D and E may 
prohibit the reorganization of the total grouping. However, if such 
dependencies do not exist, then Processes and F may be combined 

and reorganized to produce the resultant module. 

MODULE P_E_F 

OPEN INPUT WAREHOUSE-TRANSACTION-FILE, OLD-INVENTORY-PILE , 

OUTPUT OLD- INVENTORY-FILE, CUSTOMER-ORDER-FILE, CUSTOMER- 
PAYMENT-FILE. 

REWIND WAREHOUSE-TRANSACTION-FILE . 
LABEL. READ WAREHOUSE-TRANSACTION-FILE AT END GO TO CLOSER. 

IF CODE OF WAREH0USE-TR7.NSACTI0N EQUAL 'R' THEN UPDATE- INVENTORY- 
FILE ELSE 

IF CODE OF WAREHOUSE-TRANSACTION EQUAL 'P' THEN WRITE CUSTOMER- 

PAYMENT-REC FROM WAREHOUSE-TRANSACTION ELSE 
WRITE CUSTOMER-ORDER- REC FROM WAREHOUSE-TRANSACTION. GO TO LADEL. 
CLOSER. CLOSE ALL FILES. 
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A methodology is described for the automatic design of a processing 
system initially defined in terms of logical processes or program modules. 
Processes and files are grouped and reorganized in such a way as to 
produce an optimal design with respect to a specific target machine. 
Performance criteria for the optimal design is defined in terms of 
transport volume savings and core memory requirements. 

Starting with a graph theoretic representation of the interaction 
between processes (or modules) and files, the methodology consists of 
two components: (1) a generator of feasible alternatives and (2) a 
procedure for reorganization and code generation for specific groupings. 
The generator for the feasible alternatives uses an implicit enumeration 
algorithm to optimize process groupings in an efficient manner. The 
objective is to group processes into modules which minimize the inter- 
action between modules while still satisfying the logical requirements 
of the program and the physical constraints of the hardware. Finally, 
after the program modules have bean f9e»cified, program and file reorgani- 
zation will be performed to further optimize the design. Reorganization 
includes the combination of similar data passes on the same file to 
minimize transport volume and the merging of loops to enable elimination 
of code and of intermediate data til* 

The code generator will then accent the optimal program design and 
produce an optimized source iancjuage program for the target machine. 
Consequently, not only can an optimal design for the processing system 
be generated; but due to reorganisation techniques, the resultant modules 
(defined from specific process qvoixpinqa) may approach the computational 
efficiency expected of an integrated program. 

Although an automatic reorganizer has not been developed for COBOL, 
one has been implemented for Pilot (a subset of Neliac) on a C.D.C.6500 
at Purdue University. This language (Pilot) could irepresent the inter- 
mediate language L^ into which processes written in COBOL could be 
translated before they are combined and reorganized. 

Another way in which this methodology could be used is to select 
designs that are optimal with respect to a particular pricing scheme. 
For example, the program design which may be exeouted the most effioientlx 
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(with respect to transport volume) on a specific configuration could 
require main memory that would be disadvantageous to the uscsr accordiiig 
to a particular pricing scheme that penalizes the user for larger 
memory requirements. By generating designs for various memory con^- 
straints, such alternative designs are available. 

The methodology described in this paper could be used to break 
up programs into modules or overlays and adds a new dimension to 
program scheduling sincc-i we can now address the following question: 
"What is the optimal size of a pfdgram module?" 
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